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ABSTRACT 


In order to determine the Fourier transform of a quasi-per iodic time 
series (linear problem), or the power spectrum of a stationary random 
time series (quadratic problem) , it is desirable that data be recorded 
without interruption over a long time interval. In practice, this may 
not be possible. The effect of regular interruption such as the day/ 
night cycle is well known. We here investigate eho effect of irregular 
Interruption of data collection (the "breaking" of the window function) 
with the simplifying assumption that there is a uniform probability p 
that each interval of length r , of the total interval of length T - Nt, 
yields no data. 

For the linear case we find that the noise-to-signal ratio will 

-1 -> 

have a (one-sigma) value less than t' if N exceeds p (l-p)e 
For the quadratic case, the same requirement is met by the less restric- 
tive requirement that N exceed p *(l-p)e ^ . 

It appears that, if four observatories spaced around the earth were 
to operate for .5 days, each for six hours a day (N • 100), and if the 
probability of cloud cover at any site on any day Is 20 %. Ip ■ 0.8), 
the r.m.s. noise-to-signal ratio is 0.25X for frequencies displaced 
from a sharp strong signal by 15 uHz. The noise-to-signal ratio drops 
off rapidly if the frequency offset exceeds 15 pHz. 


EXAMINATION OF TIME SERIES THROUGH RANDOMLY BROKEN WINDOWS 


I. INTRODUCTION 

In many astrophysical problems one is concerned with the study of 
time series. It often happens that the property of particular Interest 
is the spectrum of the time series. In principle, one may determine a 
time series to a prescribed accuracy by making measurements, without 
interruption, over a sufficiently long time interval. In practice, the 
length of time over which the variables may be measured will be limited. 
Moreover, measurements may necessarily be interrupted (or otherwise impaired) 
for one reason or another. The relationship of the spectrum determined 
by limited, interrupted measurements to the intrinsic spectrum has been 
the subject of many investigations, as recently reviewed by Deeming (1975). 

If the original time series is denoted by x(t), one may regard the 
measurements y(t) as being determined by 

y (t) - f(t)x(t), (1.1) 


where f(t) Is the "window function." We regard x, v and f as being simple 
scalar functions but the procedure may be generalized to replace x, y by 
vectors and t by a tensor. 

We use the Fourier transform notation 


x(t) 



-iidt 

e 


x(ut) 


x ( at) 



x ( t ) 


( 1 . 2 ) 

( 1 . 2 ) 


where the limits of integration are to he taken to be - to + > v if other 
limits are not explicitly specified. 
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If we are interested in determining x(oj), the Fourier transform of 
the time series x(t), then we may use the relation 

y(co) ■ Jdui' i (to 1 ) x(o) - a)’ ) (1.4) 

to relate the Fourier transform of the measured time series y(t) to that 
of the original time series x(t). It follows from this experience that if 
f (.o') has appreciable amplitude at frequencies far removed from w'-o, 
for exar.pl e if it has a subsidi v,* peak at the measured time 

series \ (a>) will then reflect the behavior of the actual series not 
only at frequency u.' but also at u.v+t' 0 . The frequencies w and u> Q thus act 
as aliases of one another, and the associated distortion of the 
measured spectrum is referred to as aliasing. 

We are interested in the possibility that f(t) may be regarded as a 
random variable, expressible as 


f(t) - 


F(t;a .a, 


•V 


n; 


■l F n 


(t. ^) 


(1.5) 


where are independent random variables with a common expee- 

tation distribution. By the central limit theorem (Papoulia, 19b5), we 
expect that the random variable f (or its Fourier transform f) will have 
a distribution approximately Gaussian in form if N is not a small number, 
so that an adequate representation of f would be given by Its mean <f> 
and Its standard deviation o(f). 


If, on the other hand, x(t) is a random time series, we will be 


concerned with the autocorrelation function R^(t\ defined by 


R x (t) - <x(t*) x(t'+t)>. 


( 1 . 6 ) 


and the power spectrum of the time series, defined as the Fourier transform 


of R (t); 
x 




-ioit 

R (t) - /do) e S (cu ’ , 

X f X 


s (<o) 

x 




dte lu)C R x (t). 


( 1 . 7 ) 


( 1 . 8 ) 


On noting that 


< x(ut) X (o)*)^> ■ S (ul) 6 (ul + u) ' ) 


( 1 . 9 ) 


and evaluating <y(co) y (o)*)> , we may verify that 


S (oi) - Idai' w(ui') S (u> - at’ 

y J 


( 1 . 10 ) 


where 


W (u>) • f (u>) f (- O) . 


( 1 . 11 ) 


Clearly the function w( u i) represents the capability of the tneasuremen 
process, described by the "window function" f(t), to determine the power 
spectrum S^fic). The function w(u>) may be expressed in terms of the inde- 
pendent random variables 

w(u>) - W(u'; 0 ^, 0 ,, . . . ,n N ) . (1.12) 

Once again, unless N is a small number, we expect that the distribution 
of w will be approximately Gaussian so that it may be characterized by 


3 


Its mean value <w> and standard deviation cr(w). 

This article was prompted by a problem related to the determination 
of normal modes of oscillation of the sun, as determined by measurement of 
the photospheric velocity field. Measurements have been presented by 
Deubner (1975) and by Rhodes et al. (1977), and their theoretical inter- 
pretation discussed by Ulrich and Rhodes (1977) and by Ulrich et al. (1978). 
For optimum determination of the power spectrum of the velocity field 
(expressed as a function of wave number), it is clearly desirable to make 
observations without interpretation over as long an interval as possible. 
Away from polar regions, observations from a single station are interrupted 
by the day-night cycle which leads to unacceptable aliasing of the data. 
Observations made from a spacecraft in polar orbit would obviously yield 
un-aliased data of higher quality and higher frequency resolution. Obser- 
vations made from the south pole during austral midsummer can lead to 
several days of uninterrupted observation and to still longer intervals 
with occasional, irregular interruption. It is also possible to select 
three or four stations around the earth which, in the absence of any cloud 
cover, could give continual coverage of the sun for many weeks. However, 
one must anticipate that some of the data would be lost by cloud cover. 

It is clearly desirable that one should be able to make some esti- 
mate of the accuracy with which oscillation modes may be determined when 
it appears possible to observe the sun over a long interval of time losing 
some blocks of time because of cloud cover. The purpose of this article 
is to develop a model which enables us to address problems of this type. 


After presenting a few general formulas, we shall simplify the problem 
considerably by supposing that observations are made over a large number 
N of equal time intervals, each of length T , so that the total time 
interval T is given by 

T - Nx . (1.13) 

With certain additional simplifying assumptions, we shall consider the 
statistical properties of the functions f(w) and w(u>) which are repre- 
sentative of "randomly broken" window functions. 

II. MATHEMATICAL MODEL 

in the case that the window function f(t) is expressible in the form 
(1.5), in terms of a number of random variables, we wish to study the distri- 
bution of the functions f (o>) , w(oj) , entering equations (1.4) and (1.10). 

We suppose that the distribution of the variables ct to a N is given by 
the probability function P (ct^ , ..., a^) such that P(a^, ...» c* N )da^ ..., da^ 
is the probability of finding ot^ in the range to a^+da^, etc. Then the 
expectation value of the quantity f (u>) is given by 

<f(u>)> - jd\ P N (a) F(w;a 1 ,...,a N ), (2.1) 

N 

where d‘ a denotes dot, .... da . and P„(a) denotes P(a., a ). If we 

INN L N 

use the following notation for the variance of a complex variable of a 
complex variable z, 

o“(z) - 0 2 ( z f ) + o'Cz^, (2.2) 

where z^ and z^ are the real and imaginary parts of z, then noting that 
f (- at) is the complex conjugate of f (u>) , we see that 


5 


d 2 (f <o>)) - <f(o)) f (-<o)> -<f(ai)> <f (-<*))> . 


(2.3) 


The first term on the right-hand side may be evaluated from 


<w(0))> s<f(a>) f (-0)» 



P N (a)P(o);a lt . . . ,a N ) F (- w; 



(2.4) 


We see that equation (2.4) also gives the expectation value of the 
"window spectrum" w(w) which appears In equation (1.10) and is appropriate 
for the discussion of stationary random time series. The variance of this 
function is given by 


o (w) ■ <w(co) w (- co)> - <w(o>)> <w(-w)> 


(2.5) 


where 


^w((ii) w(- (i))^ ■ aP^j(a) j F(ai;cx^, • . . , 0 ^ j | F (— a> , ot^, ...,a^)j . (2.6) 

As indicated in the introduction, we intend to consider the case that 
the observing time to t^, of length T, is divided into N equal intervals 
bounded by times t^, t£ f ••• where 


Is. 


t ■ t.+nT 
n 0 


(2.7) 


so that we may adopt the form 

N 

°n | h<t - t „-i ; - h(t - t n ) 


F(t; 


( 2 . 8 ) 


n * 1 


where h(t) is the Heavyside function: 

h(t) - 0 if t < 0, j 
- 1 it t > 0 . ^ 

We also assume that the Intervals are statistically independent, so .hat 


(2.9) 
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we may write 

P N (a)d N a = P(a lf ..., a^dc^ dc^ - |^(a x ) da iJ* • *^N (a N )da N j * (2 * 10) 

If we assve that, for each interval, there is a (uniform) probability p 
that the window is open and probability 1-p that it is closed, then 

P(a n ) - p 6(a n -l) + (l-p)6(a n ) . (2.11) 

In evaluating (f(o))), given by (2.1), we will use 

<<!„> - p . (2.12) 

In evaluating the quantity given by equation (2.4), we will need r.o 
evaluate ^ot m aV which is clearly given by if n but by p if in « n. 

Hence 

<a m a n >- P 2 + ( P'P 2) 5 mn’ (2J3) 

ra n an 

where 6^ is the Kronecker function. In evaluating the quantity given 

by (2.6), we need to evaluate the expectation value of a a a a . By con- 

m n p q 

sidcring the various possibilities (m,n,p,q all different; two of them 
the same, etc.) we find that 


3 4 , 


<ja a a a_>-(p + p -p ) (5 + 6 + 

^mnp<r K mn mp 


5 + 6 + 6 ) 

nn nq pq 

+ (p 2 -2p 3 +p^) (66 +66 +66, 

F y v mn pq mn nq mq np) 

+ (p^-lp^+Zp^) (6 +6 +6 +6 

K r npq mpq mnq mnp) 

(p-7p'+12p 3 -6p ) 6 
r ^ r mnpq 


(2.14) 


where 6 -lifm-n-p otherwise 0, and 6 is defined similarly, 

mnp mnpq 
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III. EVALUATION OF MODEL 

For simple (non-random) time series, equation (1.4) gives the relation- 
ship between the Fourier transforms of the original and measured time series. 
In this context, tha properties of the random window function f(t) may be 
characterized by ^f(oi)^ and o 2 (f) . 

On substituting the fora (2.8) into (1.5), we find that 


V - to i • ln (H ,l “l t ° + < "'^ • 

n - 1 ' ' 


On using (2.1) and (2.12), we obtain 


<f(w)> - T p sinc(y 0 ) T) e 


where sine 6 ■ 8 ^ sin 9. 

On using (2.4), we f’nd that 


2 ltt(t o + S ) 


<w( 0 ))>- (~) 2 T 2 


p 2 8inc 2 (ycuT) + N 1 p(l-P)»inc 2 (ya)T) 

TO 


Hence, using (2.3), we obtain 


o 2 (f) - 2 T 2 N -1 p ( 1 - p) sinc 2 ^ cut). 


(3.1) 


(3.2) 


(3.3) 


(3.4) 


For evaluating the effects of "breaking" of the window function, it 
is convenient to normalize the standard deviation with respect to the maxi- 
mum value of f (u), which is the value at cu • 0. Accordingly, we introduce 
the definition 


.. x X 0(f(o>)l 

r i ( “> * Tm> 


(3.5) 


For the case under consideration, this has the form 


^(cu) - N 


- 1/2 - 1/2 . 1/2 | . 1 . 
p (1 - p) | sine — u)T j . 


(3.6) 
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For discussion of the properties of randomly broken windows in Che 
study of stationary random time series, it is necessary to evaluate the 
mean value and atandard deviation of w (u). The former is given by equa- 
tion (3.3). The first term inside the brackets has the same form as arlsaa 
in the non- random case (p ■ 1) . The second term represents a change in the 
mean spectrum, so it is convenient to introduce the symbol for the 

ratio of the additional term to the maximum value of the principal term: 

A 2 - N -1 ?" 1 (1-p) sine 2 (^<ut) . (3.7) 

On writing equation (2.6) in the simpler form 

<w(w)w(-«)> - <F(u;a)F(<o;o)F(- u; a)F(-<o; a)> (3.8) 


and using equations (1.3) and (2.8), we see that 


<w(u>)w(-co)> 


y < a > U * U * 
t—> ' mnpq m n 


(2^)“ me, ' ” pq 


iu)t itot 
m 


where U (cj) ■ e - e 

IQ 


m_1 and U *(ai) - U (-oi) 


U U . (3-9) 

P 9 

is the complex conjugate 


of U . 
m 

On using equation (2.14), ve see that this may be expressed in the form 


<w(u))w(-w) > - — - — , !p^E. + (p^"P^)( 2E -> + 

(2^) 

+ 4 (p 2 - 3p 3 + 2pS E 4 (3.10) 


+ (p - 7p 2 + 1 2 p 3 - 6pS E 5 
+ (.p“ - 2 p 3 *■ p^) (E^ + 2 E 7 ' | . 

where 
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(? "’)(? ; "' 2 ) 

? I u »r • 


(?N 2 ) 2 


On evaluating these sums 

, we find that 

E l“ 

lb jin^|y ut| , 

E 2- 

16 sin 2 (j cjt| sii 

E 3 

16 N sin 2 ||wl) : 

E * ' 

16 sin 2 |-| cousin 

E 5 “ 

16 N sin 4 (~ «t) 

E * 

9 

“ •‘■‘(rH* 

E, - 

2 4 / v 

16 N sin / wt | 




for 


(That E 2 and are real Is due to the fact that 
all m.) 


U 

m 



Hence equation (3.10) is found to be expressible as 


<w(co) w (-u>)> - Jp A sinc 4 |^lj 


+ 2 p 2 (l-p ) sine / cot\ sine / wx_|2 + sine (cuT ) + 2^1-2p) 
N \2 ) \ 2 sine ojt N 


+ sine 4 

N 


(f )[««( 


1 2 + if fS 


6p - 6p + 1 

2 

N 


On using equations (2.3), (3.3) and (3.13) and the definition 

a (w(co)) 


(3.13) 


(3.14) 


we find that 


£ ^ Cco ) 


2 (1-p ) sine 
NP 


(if) sl “ 2 ( f-) 


1 + sine (cuT ) + 2(l-2p) 
sinc(jJT) Np 


/lz£\ 

sine 4 

1-p / l ( sinc 2 (coT)\ ( 6p 2 -6p + 1 

\NP / 


^ ' sinc 2 (a3t)^ ( Np) 2 


(3.15) 
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IV. DISCUSSION 


We see from the preceding section that the effect of a random "break- 
ing" of the window function Is to produce an aliasing of any signal. This 
effect is described by the function E^ or by A^ and E^ as given by equa- 
tions (3.6), (3.7) and (3.15). 

For the "linear" problem, as described by equation (1.4) the mean 
Fourier transform of the window function, as given by equation (3.2)* 
has the same form as it does in the non-random case, although it is 
reduced by a factor p. The standard deviation is characterized by E^, 
defined by equation (3.5) and given by equation (3.6). 

It is convenient to introduce the notation 


«> T - 2T -1 , u> T • 2t -1 . 

On noting that sine 0^1 for 0 ,< 1, and that |slnc 0) 
we see that ^ where 



(4.1) 



for 0^1, 



The above equations suggest that the aliasing is most severe 

within the range of a few Limes of the strongest signal. It is 

l 

informative to compare S^(m) with C^(ui) defined by 


< fju£> 

<f(o» 






1 , 


1) s u' 

T 


» u' > III 

- T, 


(4.3) 
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which represents the normalised envelope of the "core" of the window 
function, which has the "ideal" form which obtains in the absence of 
breaking. The variation of both C^(w) and S^(oj) with 0 ) is shown 
schematically, on a logarithmic scale, in Figure 1. 

We see from this diagram that the random interruption of data 
collection adds a "wing" to the core of the window function. This 
wing has a height J" fl-p)/Npl over the range &> c to where 



provided that N > p/(l-p) . 

We may readily calculate the minimum number of intervals 
necessary to ensure that 7 .^ is below an assigned level e for a given 
value of p. We see from (4.2) that S^S £ if N i where 


N 


1 



(4.5) 


If, for instance, p ■ 0.8 and we require that 0.05, N must be at 

least 100. 

For the quadratic problem in which we are determining the spectrum of 
a stationary random time series, it is convenient, as in our discussion of the 
linear case. to Introduce C.,( a 0, the normalized envelope of the "core" 
of the window function for the case of uninterrmpted data collection. 

As we see from equation (3.3), this function is given approximately by 
the formula 


i',(uO “1. ul.-O) , 
T * 


I 


t: ) • 


(■i.bi 
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The aliasing is now described by the func ions A 2 (<i>) and I 2 (w) given 
by equations (3.7) and (3.15). We find that the envelope of A 2 (gj) is 
bounded by the function D 2 (oj) given by 


D 2 (u>) - 



(4.7) 


The effect of D 2 (a>) is to produce a "wing” to the window function, 
extending out from a)“w c » where gj c is given by equation (4~4 ) , as shown 
in Figure 3. 

Discussion of the function £ 2 (os), representing the normalized standard 
deviation of the window function from the mean form given by equation 
(3.3), is somewhat more complicated. However, if Np»l, we find from 
equation (3.15) that r^( w ) ^ (<o) where 



where 



| (4.9) 
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The functions ant * ^ 2 ( 0 ^) are shown plotted in Figure 2 for 

the particular case N * 100, p“ 0.8 and T ■ 6 hours (a case discussed 
later in this section). We see from this figure that 02 ( 0 )) provides 
an excellent approximation to the envelope of • 

The function (the normalized envelope of ti.e standard deviation 

of the window function) is also shown schematically in Figure ?. We see 
that S 2 (w) “ D 2 (to) for co>co d . We also see that S 2 (a>) < C 2 (cu) for 



and we note that D 0 (ai) C (ui) also in this range. The frequencies 

a). , io , u)., are in the proportion 
bed 


“b : 


O) 

c 


: co 

c 



(A. 11) 


We may now calculate the minimum number ^ of intervals necessary 

to ensure that S»(oj) [ and hence Z„(co) and A _ (co) ] is below an assigned 
l i. i 

level e for a given value of p. Ignoring the small region between 

to. and u)., where the condition is a little more restrictive, we see that 
b d 

the required condition is satisfied if 

N > N , “ . ( A . 1 2 ) 

2 pC 

We see from (A. 5) and (A. 12) that N 0 is smaller than by the factor 
c. Hence aliasing is likely to be less serious in the quadratic case than 


it is in the linear case. 


In order to assess the implications of the present model concerning 
ground-based observations of solar oscillations, one will need to have 
detailed estimates of the expected spectrum (in particular, the spacing 
and relative power of nearby lines) and the expected cloud cover at three 
or four observatories postioned round the world. It is also desirable that 
the present model should Le extended by considering separate values of 
p for each of the observatories, and possibly by taking into account 
the correlation between cloud cover on consecutive days. 

Nevertheless, we can illustrate the results of this model by consider- 
ing a hypothetical situation. Suppose that four observatories are located 
around the world in such a way as to give continous coverage (in the 
absence of cloud cover) , and that these observatories are operated for 25 
days. Then N = 100. Suppose that, for any observatory on any day, there 
is a 20% probability of cloud cover so that p = 0.80. We find from (4.1) 
that (using v * oj/2tt), v^. = 15pHz. For frquencies less than this value, 
(4.7) and (4.8) show that aliasing amounts to 0.25% or less. For 
frequencies above 15uHz, the aliasing drops off rapidly. 

Although it will be necessary to make more detailed and specific 
calculations to draw definite conclusions, it appears from the above 
simple example that it may be possible to carry out high-quality studies 
of solar oscillations from a chain of ground-based observatories. 

This work was supported in part by NASA Grant NGL 05-020-272, Office 
of Naval Research Contract N00014-75-0673 , and the Max C. Fleischmann 
Foundation . 
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CAPTIONS 


Figure 1 . Schematic diagram showing the relationship between S^(w) , 

the normalized envelope of the standard deviation, and C^(oi) , 
the normalized "core" function, for the linear case. The 
scales of both axes are logarithmic. 

Figure 2 . Comparison of the normalized standard deviation and 

the approximate envelope function S2(w) for the special case 
discussed in the text. 

Figure 3 . Schematic diagram showing the relationship between S2 (00) , the 
normalized envelope of the standard deviation, D ^ (w) , the 
normalized envelope of the mean displacement, and C2(u) , the 
normalized core function, for the quadratic case. The scales of 
both axes are logarithmic. 
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